Efficiently computing runs on a trie

نویسندگان

چکیده

A maximal repetition, or run, in a string, is periodic substring whose smallest period at most half the length of substring. In this paper, we consider runs that correspond to path on trie, other words, rooted edge-labeled tree where each edge labeled with single symbol, and endpoints must be descendant/ancestor other. For trie n edges, show number less than n. We also an asymptotic lower bound maximum density tries: limn→∞⁡ρT(n)/n>0.9932348 ρT(n) edges. Furthermore, O(nlog⁡log⁡n) time O(n) space algorithm for finding all runs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computing Runs on a General Alphabet

We describe a RAM algorithm computing all runs (=maximal repetitions) of a given string of length n over a general ordered alphabet in O(n log 2 3 n) time and linear space. Our algorithm outperforms all known solutions working in Θ(n log σ) time provided σ = n, where σ is the number of distinct letters in the input string. We conjecture that there exists a linear time RAM algorithm finding all ...

متن کامل

Efficiently Computing a Good Segmentation

This paper is concerned with the problem of segmenting an image into regions, using a local measure of the difference between image pixels. We develop a general framework for a broad range of segmentation problems, based on pairwise comparison of regions in a segmentation. This framework provides precise definitions of when a segmentation is too coarse or too fine. Within this framework, we def...

متن کامل

Computing Congruences Efficiently

For a fixed similarity type which is at least binary, we show there is a linear time algorithm for computing the principal congruence generated by a pair of elements. Let A be a finite algebra and let θ be a partition of A. In this note we give a simple algorithm for computing the smallest congruence containing θ and discuss its efficiency. We begin with the case that θ only contains one nontri...

متن کامل

Computing Iceberg Queries Efficiently

Many applications compute aggregate functions over an attribute (or set of attributes) to find aggregate values above some specified threshold. We call such queries iceberg queries, because the number of abovethreshold results is often very small (the tip of an iceberg), relative to the large amount of input data (the iceberg). Such iceberg queries are common in many applications, including dat...

متن کامل

Computing alignment plots efficiently

Dot plots are a standard method for local comparison of biological sequences. In a dot plot, a substring to substring distance is computed for all pairs of fixed-size windows in the input strings. Commonly, the Hamming distance is used since it can be computed in linear time. However, the Hamming distance is a rather crude measure of string similarity, and using an alignment-based edit distance...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Theoretical Computer Science

سال: 2021

ISSN: ['1879-2294', '0304-3975']

DOI: https://doi.org/10.1016/j.tcs.2021.07.011